Epidemic threshold for the SIS model on random networks 
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We derive an analytical expression for the critical infection rate r c of the susceptible-infectious- 
susceptible (SIS) disease spreading model on random networks. To obtain r c , we first calculate the 
probability of reinfection, n, defined as the probability of a node to reinfect the node that had earlier 
infected it. We then derive r c from n using percolation theory. We show that n is governed by two 
effects: (i) The requirement from an infecting node to recover prior to its reinfection, which depends 
on the disease spreading parameters; and (ii) The competition between nodes that simultaneously 
try to reinfect the same ancestor, which depends on the network topology. 

PACS numbers: 



Diseases spread in a population as infected individu- 
als infect other individuals with whom they are in con- 
tact. In recent years, several models have been devel- 
oped to mathematically characterize the spread of dis- 
eases [1-3]. The disease spreading is best modeled as a 
sparse network, where the individuals are the nodes and 
the contacts are the links connecting them [4|-|6[. Two 
epidemic models of particular importance are the SIR 
(susceptible/infectious/recovered) Q and SIS (suscepti- 
ble/infectious/susceptible) (8l-floj]. In these models, the 
TV network nodes are initially susceptible (they currently 
do not have the disease, but might become infected later), 
except for one randomly chosen infected node. At each 
time step, an infected node infects each of its neighbors 
with probability r. Infected nodes remain as such for 
r time steps, after which they become either recovered 
(cannot further infect or become infected) in SIR or sus- 
ceptible again in SIS. Hence, in SIS individuals can be 
infected multiple times. 

A fundamental question in the study of epidemics is: 
will a disease spread throughout the population, or will it 
die out? The answer to this question depends on the val- 
ues of the infection and recovery rates, as well as on the 
nature of the connections between the individuals. For 
the SIR model on random networks, the epidemic thresh- 
old and the critical infection rate above which the disease 
infects a non-zero fraction of the population were previ- 
ously derived d,[ll, 12 1. However, for the SIS model, the 
calculation of the critical infection rate is significantly 
more involved due to the possibility of multiple infections 
of the same node. 

Analytical results for SIS on networks exist for a num- 
ber of specific network models (e.g., a circle, a lattice, a 
double- layered fully connected network, etc. 13- A 
mean-field analysis of SIS on general random networks 
has been proposed in 13, LL8[ • In this approach, master 
equations are written for the number of infected individ- 
uals of a given degree. To calculate the epidemic thresh- 
old, the mean-field approach assumes that an infected 
individual has an equal probability to infect each of its 
neighbors. However, as we show below, the probability 



of an individual to reinfect the neighbor that had previ- 
ously infected it (the reinfection probability) is different 
from the probability to infect the other neighbors. 

In this Letter, we derive an analytical expression for 
the SIS epidemic threshold. We find that the critical in- 
fection rate at the threshold is higher than the mean- field 
approximation for SIS but lower than the critical infec- 
tion rate for SIR. Simulation results support the accuracy 
of our expression. To obtain the epidemic threshold, we 
first calculate the reinfection probability which depends 
on both the disease spreading parameters and the net- 
work topology. Then, using percolation arguments we 
derive the threshold from the reinfection probability. 

To derive the epidemic threshold, some definitions 
from percolation theory are needed. In a percolation pro- 
cess on a network, links are removed until only a fraction 
p of the N network nodes remain. This is continued until 
a critical value, the percolation threshold p c , is reached. 
For p > p c , a spanning cluster of order N nodes exists, 
while for p < p c the network collapses into small clusters 

i3-ii. 

Disease spreading can be seen as a growing percolation 
process 2J|, in which, starting from a given seed, links 
are being added to the growing network with probability 
p. The critical infection rate in which the disease spreads 
throughout the network is equivalent to the percolation 
threshold in which a spanning cluster appears. To com- 
plete the mapping, we specify the epidemic equivalent of 
p, that is, the probability of a node to infect its neighbor. 
This probability is different from r, since an infected node 
i (see Fig. [1} can infect its neighbor j only as long as it 
is still infected, for t time steps. Therefore, the desired 
probability is given by fl2| : 



p= 1 - (1 -r) 



(1) 



The critical infection rate r c for a given r can then be 
obtained by substituting p c in Eq. ([1} . 

To find Pc, we define (rij), the average number of sus- 
ceptible nodes infected by an already infected node i. If 
(m) > 1, the disease will keep on spreading until a non- 
zero fraction of the network is covered [25| . In SIR, only 
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Using 7T, an expression for {rii} sis immediately follows: 



FIG. 1: A schematic illustration of the sub-network in which 
the probabilities of infections are calculated in the text. 



the descendants of a node should be taken into account, 
whereas in SIS, the ancestor could also be reinfected. In 
terms of Fig. [TJ node i, once infected, can infect not only 
its descendants ji, j2, ■•■ (hi SIR) but also its ancestor a 
(in SIS). 

The probability of a node that is reached by following 
a link to have degree k (one incoming link and k — 1 
outgoing links) is kP(k)/ (k) where P(k) is the degree 
distribution of the network nodes. Therefore, as long 
as the network has a tree like structure and loops are 
negligible [23[ , the average number of neighbors infected 
by node i for SIR is given by (25} : 



(Ui) 



SIR 



P 
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fcP(fc) 



(fc_l)=p(K-l), 



(2) 



where k — 1 = [(k 2 ) / (k)] — 1 is the branching factor. At 
the epidemic threshold, p c {n — 1) = 1 23]. Writing p c in 
terms of r c and r (Eq. ([1])) we obtain: 

In SIS, the additional link leading to the ancestor 
should be incorporated into the calculation. This can 
be accomplished by making use of an effective branch- 
ing factor, obtained by replacing k — 1 by k (the mean 
field approach). For example, consider node j\ in Fig. [T] 
in addition to the k — 1 links that are connected to the 
descendants d\, i, it also has the link to i. 

However, this approach is too simplistic: the proba- 
bility of j\ to reinfect its ancestor is different from its 
probability to infect its descendants. This is true since 
as opposed to the descendants, i is not necessarily suscep- 
tible. This is a combined outcome of two effects. First, 
since i must have been sick before infecting ji in the first 
place, ji can only reinfect i after i has recovered and 
became susceptible again (the "time" factor). Moreover, 
one of the other descendants of i (j 2 , j K -i hi Fig. [T]) 
might reinfect i before ji does (the "neighbors" factor). 
Below, we derive an analytical expression for the prob- 
ability of reinfection n, taking into account these effects 
and comparing it to the mean field approach. 



(4) 



The first term is the SIR branching factor (Eq. ©) and 
the second is the SIS-specific contribution of the proba- 
bility to reinfect the ancestor. 

Our analysis depends on the assumption that the de- 
scendants of an infected node d\, d K _i are susceptible 
and therefore the probability to infect them is p. This 
assumption is commonly used in epidemic models when 
studying disease spreading (e.g., [y, [HI), but limits the 
validity of the results to the region of at or below the 
epidemic threshold. For SIR this assumption is trivially 
true since reinfections are not permitted and the disease 
spreads directionally down a tree structure. However, 
for SIS where reinfections are allowed, the disease can 
spread down and back up the same branch, and thus the 
neighbors of a just infected node may already be infected. 
Nevertheless, since at or below the threshold the infec- 
tion rate is already low, the probability that the disease 
has spread down and then back up is extremely small 

To calculate ir we first obtain the probability 7r t of j\ 
to infect i, considering the "time" factor and ignoring the 
"neighbors" factor. Assume node i has been infected for 
s time steps before infecting ji . Since the total lifetime of 
the disease is r time steps, i remains infected for (r — s) 
time steps after infecting j\. Therefore, the total time in 
which ji is infected and i is not is r — (r — s) = s. The 
desired probability ix t is obtained by conditioning on s: 



(i - r y 



^ i 

s=l 



(1 - r) 



r 1 — (1 — r) T+1 



where (1 



\s-l 



r/[l — (I — r) T ] is the probability that i 



infected ji at step s given that ji was eventually infected, 
and [I — (1 — r) s ] is the probability that ji infected i in 
at most s steps. 

To complete the derivation of 7r, we incorporate the 
effect of competition (which node is the first to reinfect 
i) between ji and the other descendants of i. Since i 
is arrived at by following a link from a, it will have on 
average k— 1 descendants, ji, ...,j K —±. In addition, i can 
also be reinfected by its ancestor a. Therefore (27] : 



7Ti 



-l 
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( piT t ) k (I -pKtT 1 k 

k' + 1 



(6) 



To see this, recall that 7r 4 is the probability of j\ 
to infect i, independently of its siblings a, ji, ...,j K -i. 
Thus, pirt is the probability of a sibling of j\ to 
be infected by i, and then reinfect i. From this, 

it C^ 1 ) (p^t) k (1 — pnt) K 1 k is the probability that ex- 
actly kl siblings of j\ , in addition to j\ itself, have suc- 
ceeded to reinfect i. However, after the first out of these 
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k 1 + 1 nodes have reinfected i, none of the other k' nodes 
can do so. Since all of these nodes are a-priori equivalent, 
the probability of ji to successfully reinfect i is obtained 
by dividing by k' + 1. Finally, the desired probability 7r 
is obtained by summing over all possible values of k' . 

Next, we compare simulation results for the reinfec- 
tion probability with the three approximations presented 
above: (i) p - the reinfection probability assuming it is 
equal to the probability to infect any other node (mean 
field approach); (ii) ix t - the reinfection probability tak- 
ing into account only the "time" effect (Eq. ©); (in) n 
- the reinfection probability according to Eq. . Recall 
that due to the assumption explained above, the equa- 
tion for 7r is expected to hold only at or below the epi- 
demic threshold. Fig[2] shows that while Tr t is a better 
estimator of the reinfection probability compared to the 
mean-field approximation, our Eq. ([6J is superior to both 
and agrees with the simulation results up to the epidemic 
threshold (indicated with an arrow). For increasing val- 
ues of r (Figs. EJa) and (c)) two opposing effects occur: 
While p by itself grows with t, increasing the probability 
of reinfection, the growth of r also reinforces the "time" 
and "neighbors" effects, thus attenuating the increase in 
7T. An increase in r (Figs. EJb) and (d)) has a simi- 
lar outcome. As r increases, p increases. In addition, 
large values of r imply also that j%, j K -x, a are more 
probable to infect i before ji, thus decreasing the total 
reinfection probability 7r. 

Our analysis has so far been general for all random net- 
works with a prescribed degree distribution P(k). Two 
specific models are of special interest. The first is the 
Erdos - Renyi (ER) network model 21, 22,113, in which 
all links exist with equal probability cf>, leading to a Pois- 
son degree distribution P(k) = e — W (k) k jk\ with aver- 
age degree (k) = (N — 1)0. The ER network model has 
become a classic model in random graph theory and was 
intensively studied in the past few decades. The other 
model is that of scale free networks (SF) 29§] : networks 
with a broad degree distribution, usually in the form of 
a power-law, P(k) ~ fc~ 7 with 7 > 2. It was found that 
many natural networks are scale- free [29| . 

Using our results, the formulation of the critical infec- 
tion rate for the SIS epidemic model on both ER and SF 
networks is straightforward. At the threshold, 



p(r c )(n - 1) + 7r(r c ) = 1. 



(7) 



For ER networks, « - 1 = (k 2 ) / (k) - 1 = (k). Thus, 
the equation for the critical infection rate is p(r c ) (k) + 
7r(r c ) = 1. For SF networks, the value of k — 1 depends 
on the degree exponent 7. Theoretically, K diverges for 
infinite SF networks with 2 < 7 < 3, leading to the trivial 
solution r c — 0. In practice, even for 7 < 3 finite net- 
works have finite degrees and thus the epidemic threshold 
is non-zero [30| . 

Simulation results for the critical infection rate r c are 
presented in Fig. [31 with the theoretical values of r c ob- 
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FIG. 2: Simulation and theory for the reinfection probabil- 
ity. Three approximations: p (dot-dashed lines) ,-Kt (dashed 
lines), 7r (solid lines) are compared with simulation results (see 
text). Simulation results (circles) were obtained by recording 
the probability of a infected node to reinfect its ancestors, av- 
eraging over many network and disease configurations. The 
simulations presented are for ER networks but were validated 
also for SF networks. The epidemic threshold is indicated 
with an arrow, and is calculated from Eq. 0. (a) K = 2.1 
r = 0.15 and increasing values of r. (b) k = 2.1 r = 2 and 
increasing values of r (c) k = 5 and r — 0.03. (d) k = 5 and 
t = 2. 



tained by numerically solving Eq. ([7]) for given r and k. 
In Fig. E^a) we plot theory and simulations for r c vs. r 
for ER and SF networks. In Fig. |3jb) we compare our 
theory, the mean-field approach, and simulation results. 
For large values of r, the the mean-field approach devi- 
ates by upto 10 percent, while our theory is accurate 
(see note [31(). The inset of Fig. [31(b) compares SIR and 
SIS. 

The SIS epidemic model we analyzed has been limited 
to the case of a fixed recovery time. This is because, 
to calculate the epidemic threshold, we based our the- 
ory on the ma ppi ng between bond percolation and the 
SIR model " 
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25] . However, it has recently been shown 
that this mapping is valid only when the recovery time 
is fixed, whereas variable recovery time introduces cor- 
relations not present in the bond percolation model [32| . 
Nevertheless, we argue that our reinfection probability 
7r is an upper bound for the reinfection probability with 
variable recovery time. This can be shown by simulations 
as well as by the following intuitive argument. Consider 
the simple case when only the time effect is taken into 
account and n = 7r f = [1 — (1 — r)( T+1 )]/(2 — r). For a 
fixed recovery rate, r is a random variable, geometrically 
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FIG. 3: Simulation results for the critical infection rate, (a) 
Simulation results of r c vs. r for ER networks with (k) = 2, 4 
and SF networks with 7 = 3.6 (symbols), compared to theory 
(lines) as obtained from Eq. iff}, (b) Theoretical values of 
r c (Eq. (O; solid line) compared to the mean-field approach 
(obtained by setting n = p in Eq. 0; dashed line) and 
simulation (circles) for (fc) = 4. The inset compares r c (theory 
and simulations) between SIR (circles) and SIS (squares) with 
(k) = 2. Theoretical values (lines) were obtained from Eq. (|3]) 
for SIR and from Eq. (J7J for SIS. The epidemic threshold was 
calculated using a method of percolation theory: denote by S 
the total number of infections; r c was identified as r for which 

distributed with mean (r). Noting that ^| < for all r, 
it follows from Jensen's inequality that (tt(t)) < 7r((r}). 

In summary, we analyzed the propagation of diseases 
on random networks according to the SIS model. The 
presence of reinfections in the model poses a great chal- 
lenge since it can no longer be mapped into a simple 
branching process. We overcame this problem by calcu- 
lating the reinfection probability. With this probability, 
we could recast the problem back into a branching pro- 
cess with an effective branching factor that takes into 
account not only the descendants of a node, but also its 
ancestor, weighted by the probability of reinfection. The 
condition for an outbreak was then found using the rein- 
fection probability and percolation arguments. 
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